Skip to content

feat: Document Extract: 1) Support extracting images from PDF file. 2) Output the extracted image list#5277

Open
wangliang181230 wants to merge 1 commit into
1Panel-dev:v2from
wangliang181230:PR/15-feat-DocumentExtract-parseImageInPDF
Open

feat: Document Extract: 1) Support extracting images from PDF file. 2) Output the extracted image list#5277
wangliang181230 wants to merge 1 commit into
1Panel-dev:v2from
wangliang181230:PR/15-feat-DocumentExtract-parseImageInPDF

Conversation

@wangliang181230
Copy link
Copy Markdown
Contributor

@wangliang181230 wangliang181230 commented May 20, 2026

feat: Document Extract node:

  1. Support extracting images from PDF file.
  2. Output the extracted image list.

新特性: 文档内容提取 节点:

  1. 支持提取PDF文件中的图片了。
  2. 输出从Word、PDF等类型的文档中提取到的图片列表(image_list),可放到 图片理解 节点中进行解读,如图:
    1. 流程节点截图:
      图片
    2. 文档内容提取PDF效果截图:
      图片
    3. 图片理解入参截图:
      图片
    4. 图片理解回答效果截图:
      图片

@f2c-ci-robot
Copy link
Copy Markdown

f2c-ci-robot Bot commented May 20, 2026

Adding the "do-not-merge/release-note-label-needed" label because no release-note block was detected, please follow our release note process to remove it.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

@f2c-ci-robot
Copy link
Copy Markdown

f2c-ci-robot Bot commented May 20, 2026

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:

The full list of commands accepted by this bot can be found here.

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@wangliang181230 wangliang181230 force-pushed the PR/15-feat-DocumentExtract-parseImageInPDF branch 4 times, most recently from ccbc4fb to cefa888 Compare May 20, 2026 18:43
@wangliang181230 wangliang181230 changed the title feat: Document Extract supports parse the image in PDF file feat: Document Extract: 1) supports parse the image in PDF file. 2) Return the document image list May 20, 2026
@wangliang181230 wangliang181230 changed the title feat: Document Extract: 1) supports parse the image in PDF file. 2) Return the document image list feat: Document Extract: 1) Support extracting images from PDF file. 2) Return the image list May 21, 2026
@wangliang181230 wangliang181230 force-pushed the PR/15-feat-DocumentExtract-parseImageInPDF branch from 11ae13c to 0acd1ed Compare May 21, 2026 06:25
@wangliang181230 wangliang181230 changed the title feat: Document Extract: 1) Support extracting images from PDF file. 2) Return the image list feat: Document Extract: 1) Support extracting images from PDF file. 2) Output the extracted image list May 21, 2026
@wangliang181230 wangliang181230 force-pushed the PR/15-feat-DocumentExtract-parseImageInPDF branch from ce6fe3f to 52e9717 Compare May 22, 2026 10:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant